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Abstract — Recently, Tarokh and others have raised the 
possibility that a cognitive radio might know the interfer- 
ence signal being transmitted by a strong primary user in 
a non-causal way, and use this knowledge to increase its 
data rates. However, there is a subtle difference between 
knowing the signal transmitted by the primary and the 
actual interference at our receiver since there is a wireless 
channel between these two points. We show that even an 
unknown phase results in a substantial decrease in the 
data rates that can be achieved, and thus there is a need 
to feedback interference channel estimates to the cognitive 
transmitter. We then consider the case of fading channels. 
We derive an upper bound on the rate for given outage 
error probability for faded dirt. We give a scheme that 
uses appropriate "training" to obtain such estimates and 
quantify this scheme's required overhead as a function of 
the relevant coherence time and interference power. 

I. Introduction 

Knowledge of different aspects of a wireless chan- 
nel can be exploited in various ways to increase the 
achievable data rates. For example, consider an OFDM 
transmission when the channel has frequency selective 
fading. The receiver needs to know the channel in order 
to do equalization and thereby interpret the information 
on the various subcarriers. To support this equalization, 
the transmitter helps by dedicating some of its energy 
to transmitting known pilot tones [1]. As the bandwidth 
gets large and the number of fading parameters increases, 
the overhead required to learn these parameters at the 
receiver increases and it has been shown [2], [3] that 
the optimal signaling in wideband communications is 
'peaky' — concentrating most of its energy in a few 
time/frequency slots. 

The receiver clearly needs to implicitly or explicitly 
learn the wireless channel, but in many cases, the system 
can benefit from having the transmitter exploit this 
knowledge as well. If the channel is frequency selective, 
then the transmitter can use the simple water-pouring 



scheme [1] to allocate its power across subchannels to 
achieve optimal rates. [4] extends this insight to practical 
settings where the signal constellations are constrained. 
If the fading is not frequency selective, then for a single- 
input single-output channel, transmitter knowledge does 
not really impact waveform design. 

In practice, channel knowledge is never perfect. Chan- 
nel uncertainty at the receiver comes from the finite 
underlying coherence time of the time-varying wireless 
channel. In wireless systems that time-share a single 
frequency between forward and backward links, wireless 
reciprocity induces a similar channel uncertainty at the 
transmitter. More typically, the limitation in channel 
knowledge at the transmitter comes from the quality of 
the feedback link over which the channel information 
is sent back. Such uncertainty in knowledge of the 
channel is known to result in appreciable degradation in 
performance [5], [6], [7]. Even for fiat fading, [5] shows 
that for the MIMO broadcast channel, fading uncertainty 
reduces the throughput substantially at high SNR. 

This paper explores the impact of channel uncertainty 
in the cognitive radio context. Generally, a cognitive 
radio senses its environment, learns from it, and oppor- 
tunistically uses the channel resources. While in many 
cases, the focus is on finding empty bands in which to 
transmit, [8], [9], [10] suggest a radically different per- 
spective. They suggest that very strong interference from 
a primary user can be exploited by a communication 
system to increase its achievable rate beyond what is 
possible by simply treating the interference as noise. 

If the receiver perfectly knows the interfering signal, 
it can simply subtract it off from the received signal and 
thereby pretend that there is no interference. It makes 
no difference if the knowledge is delayed, instantaneous, 
or non-causal. The idea that noncausal knowledge could 
be useful at the transmitter is attributed to Gelfand and 
Pinsker [11]. In a surprising result, Costa [12] proved 
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Fig. 1. A cognitive radio cognizant of signal transmitter by the 
primary. For simplicity, we assume thiat tiie interferer's cliannel results 
in a phase shift, e-'". 



that the capacity of a channel with additive Gaussian 
interference (known non-causally at the transmitter) is 
same as that with no interference at all. This is called 
"dirty paper coding" and it forms the basis for the 
schemes in [8], [9], [10] where it is assumed that the 
transmitter is able to decode what the primary is sending 
before it begins its own transmissions. 

Fig. [T] illustrates the setup. Once again, it is natural to 
consider uncertainty in the knowledge of fading. After 
all, there is a difference between knowing the transmitted 
primary signal and knowing the interference caused by 
this signal at our own receiver. We consider the case of 
flat fading and concentrate on the unknown phase shift 
for simplicity. We pose the following questions: 

• How important is it for the secondary transmitter 
to have precise interference phase knowledge? 

• How does the fade amplitude affect the achievable 
rates? 

• Does feedback help and if so, how should we use 
it? 

The problem of uncertainty in knowledge of the inter- 
ference at the transmitter has been addressed in [13][14]. 
[13] models interference uncertainty by allowing the 
channel interference vector to take one of two possible 
values, both known at the transmitter only. The two 
possible interference vectors are drawn independently 
with Gaussian distributions. The transmitter does not 
know which particular value the interference assumes. 
[13] shows that situation is equivalent to a broadcast 
channel model with a common message, where the 



transmitter knows the interference vectors to the two 
receivers. They are able to show an explicit rate penalty 
for not knowing the interference exactly. For the scenario 
in Fig. [H however, the model in [13] is not realistic: 
the possible realizations of the interference vectors are 
far from independent. It is unclear from [13] whether 
phase-uncertainty alone would impose a significant rate 
penalty. 

[14] considers the general problem of uncertainty in 
knowledge of interference, modeling it as a compound 
Gelfand-Pinsker problem. This model does capture the 
scenario in Fig. [T] and gives the following lower bound 
Ci and upper bound Cu on the capacity C. 

Ci= sup mi[I^{U;Y\W)-I{U;S\W))] 

Pu\x,s,w,Px\s.w,Pw f^'^^ 

(1) 

Cu= sup inf[ sup I^{U;Y\W)-I{U;S\W))] 

Px\s,w,Pw ^^'^ Pu\x,s,w 

(2) 

where /5 G C is a parameter characterizing the uncer- 
tainty in the knowledge of state S. Notice that both 
bounds are expressed as supremums over the distribution 
of an auxiliary random variable U. Evaluated for any 
particular U, ^ gives a lower bound to the capacity. 
However, any particular U does not give a valid upper 
bound in ([2]). That requires taking a supremum over all 
possible C/'s, and it is not clear from [14] what the 
appropriate choice of U is. Without such guidance, the 
upper bound Q is not computable for even a simple 
wireless channel model. 

In addition to the bounds, a scheme is suggested in 
[14] for flat fading channels. The scheme uses Costa's 
dirty-paper strategy, where the auxiliary random variable 
U is defined to be equal to X + aS. A seemingly reason- 
able value of Q is chosen, according to the distributions 
of the fade parameters. Unfortunately, the constant a be- 
comes zero if the fading coefficient has zero mean. Since 
Rician fading has a non-zero mean, [14] concentrates on 
that case. For simple phase uncertainty, the strategy is 
not optimal since a = implies ignoring the interference 
knowledge at the transmitter. In Section |V] we show a 
scheme (inspired by [13]) that performs better at low 
SINR. 

Sacrificing generality, we obtain stronger bounds than 
[14] for particular case of phase uncertainty. Section JV] 
shows that the lack of phase knowledge can substantially 
reduce the attainable rates. This is done by modifying the 
bounding strategy in [13]. Section [Vll considers the case 
when low-rate feedback is available from the receiver. 
We show that if the transmitter encodes certain "training 
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data", an asymptotically vanishing rate-loss is incurred 
by having to learn the phase to some fidelity and and 
communicate it back to the transmitter. For the residual 
uncertainty in phase, we give a modified dirty-paper 
coding scheme that achieves the capacity for perfect 
phase knowledge in the limit of uncertainty going to 
zero. 

We then use the de consider the problem of fading 
uncertainty regarding the primary's transmission, where 
the magnitude of the fade is also uncertain. In the 
classical point-to-point communication problem, a deep 
channel fade necessarily introduces errors, even with 
complete knowledge of the fading coefficient. Therefore, 
the problem of interest there is finding the achievable 
rates for given outage probability. However, here we 
consider fading only for the primary's signal, which is 
really the interference to the secondary. With complete 
knowledge of the fading coefficient, the problem reduces 
to Costa's dirty-paper coding problem [12] and there is 
no channel outage due to known interference fading. 

Consider Rayleigh fading of the primary's signal. Sup- 
pose our scheme ignores the knowledge of the primary's 
transmission. The magnitude of the fading coefficient 
of the primary's transmission can be arbitrarily large. 
Hence, the interference power can be arbitrarily large 
too. The SINR, and hence rate, achieved by this scheme 
would therefore be zero! This suggests considering 
achievable rates for a given outage probability. On the 
other hand, the trivial upper bound, corresponding to 
complete knowledge of the fading coefficient, suggests 
that outage probability does not change the rate signif- 
icantly. Therefore, it is not clear if allowing for some 
outage probability would make a significant difference 
to the achievable rate. 

From a system design perspective, we conclude that in 
designing systems that exploit non-causal knowledge of 
the interference, feedback has an important role to play. 
Furthermore, the natural "broadcast nature" of wireless 
transmissions will be destroyed since the transmitted data 
is targeted at a particular phase of the interference. This 
means that schemes like [15] will not function in this 
environment. 

11. Problem Statement 
Consider the following complex channel model. 

Yi = Xi + hiSi + Zi (3) 

where 1^ denotes the received signal at the secondary 
receiver at the time instant i, Xi denotes the signal 
transmitted by the secondary transmitter. Si denotes the 



signal of the primary, hi is the channel fade coefficient, 
and Zi is AWGN. We are interested in maximizing the 
rate for the secondary. 

For simplicity, we first assume that the fade amphtude, 
\hi\ = 1, and the phase of hi is constant 9, independent 
of i. Also, we assume block encoding and decoding, with 
block-length n. Therefore, the channel model is 

Y" = X" + S"e^'^ + (4) 

where the superscript n denotes an n— length vector. The 
interference is modeled as (complex) Gaussian i.i.d at 
each instant, with variance Q and mean zero. The signal 
X is power constrained, in that the average power should 
not exceed P. The noise is AWGN, and without loss of 
generahty, its variance is assumed to be 1. Under this 
model, we seek to find the maximum achievable rate 
under no knowledge of the phase 6 at the secondary 
transmitter. 

We then consider the case when the fade amphtude 
\hi\ =7 and the phase in again unknown. The channel 
model now is 

yn ^ X" + S"e^'^ + Z" (5) 

We want to find the achievable rate region for unknown 
7 and 6. 

In what follows, W denotes the message to be com- 
municated by the cognitive radio. W is assumed to be 
chosen uniformly from the set of 2"^ messages. 

111. Upper bound on the communication rate 
under phase uncertainty of the interference 

In this section, we give an upper bound on the com- 
munication rate assuming the phase of the interference 
is unknown at the transmitter. 

In [13], the authors found bounds on the rate assuming 
that interference vector lies in a two-point set known 
to the transmitter. They observed that the problem of 
unknown interference is the same as that for communi- 
cating the same message simultaneously to two receivers, 
where the receivers face different interference vectors. 
We build on their idea. 

Consider the case when the transmitter is unsure 
whether the phase at the receiver is or cj). This un- 
certainty is certainly no worse than no knowledge of 
the phase. Hence, any upper bound for this two-point 
uncertainty set is an upper bound for our case of no 
phase knowledge. 

Suppose the uncertainty is between S" and S'^e^'^, 
for some fixed (j) < tt. Denote by Y" the output 
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Fig. 2. Bounds on the rate for interference power Q — 2db. The 
bound shows that at moderate SIR, ignoring interference and treating 
it as noise is close to optimal if we do not know the phase of the 
interference at the receiver. 
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Fig. 3. Bounds on the rate for Q=5db. At low SIR, our new bound 
becomes very loose — going above the rate with zero interference. 



corresponding to the first user and as tiie output 
corresponding to tlie second user. Tlien, 



Y^ = X" + S"e^''^ + 



(6) 



For this model, the following upper bound holds: 

Theorem 1: The rate for reliable communication for 
the channel model described in Section |ll] is bounded by 



R<-\oi 
- 2 ^ 



{P + Q + l) 



21 



4g 



(7) 



Proof: See Appendix Jl 



As an aside, we note that the bounding technique 
here can also be used to find bounds on the rate for 
uncorrected interference vectors of [13]. The bounds 
obtained are tighter than the bounds in [13] at high SIR. 

The bound for rate is plotted as a function of transmit 
power P for interference power Q = 2db in Fig. |2j for 
Q = 5db in Fig. [3] and for Q = Ibdb in Fig. H At 
low Q 2db, the bound is close to the rate achieved 
by ignoring the interference knowledge completely and 
treating it as noise. This shows that the bound is quite 
tight at low Q and that missing a single bit of phase- 
knowledge, makes the rest of the transmitter's knowledge 
of the interference almost useless! For increasing Q, the 
bound is seen to get looser. In fact, it is evident from 
Fig. |4] that the bound is extremely loose at low P and 
large Q since it goes above the 'no-interference' bound. 
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Fig. 4. The plot shows the bounds on the rate for Q = 15db. 
Our upper bound is quite loose at low signal powers. Even at high 
signal powers, it is unclear if the bound is tight because we have no 
achievable schemes that approach this performance. 



IV. Upper bound on the outage capacity 

UNDER FADE UNCERTAINTY OF THE INTERFERENCE 

In this section, we give an upper bound on the 
communication rate assuming the fading coefficient of 
the interference is unknown at the transmitter 

Assume first that the fade amplitude 7 is fixed. We 
obtain computable upper bounds on the rate for given 
7, using Theorem 1. Suppose the transmitter commits 
to a transmission rate R. We find the probability that 
the upper bound coiTcsponding to the fade amplitude is 
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Sectors 



Fig. 5. The plot shows the idea behind the sectoring strategy. The 
phase-space is divided into fc = 8 sectors, and the transmitter dirty- 
paper codes according to the central vector in each sector, S. The 
actual interference is 5", and the phase difference is A(/) 



below this rate. This gives a lower bound on the outage 
probability. 

We denote the capacity of the channel for given fixed 
7 by C(7)- 

For this model, the following upper bound holds: 
Corollary 1: The rate for reliable communication for 

the channel model described in Section |ll] with fixed 

fading coefficient 7 is bounded b)Q 



i? < - loe 

- 2 ^ 



472 Q 



The bound is denoted by Cu{'y)- 
Proof: Follows from Theorem 1. 



(8) 



□ 



Corollary 2: Assuming that the transmitter now com- 
mits to a rate R, the outage probability is bounded by 



Pout > Pr(C47) < R) 



(9) 



Proof: The bound follows immediately from the fact 
that Pout > Pr(C(7) < R) and C{j) < Cu{-/). 

The bound on outage probability depends on the fade 
distribution through the random variable C„(7). 

The bound on the rate is plotted as a function of the 

'As an aside, we note that the bounding technique here can also be 
used to find bounds on the rate for uncorrelated interference vectors 
of [13]. The bounds obtained are tighter than the bounds in [13] at 
high SIR. 



fade amplitude ^ for P = Q = 10 db in Fig. [6] The lower 
figure in Fig. [6] is the pdf of Rayleigh distribution for the 
parameter = 1. For fixed rate R = 2 bits/symbol, the 
probability Pr(i? > C„(7)) is the area of the shaded 
region in Fig. [6] This is a lower bound on the outage 
probability. Alternatively, for given outage probability, 
the curve gives an upper bound on the achievable rate. 

The bound on the rate in Fig. |6] has a surprising 
feature: for large 7, it increases on increasing 7. We 
think this is an artifact of the bounding technique used, 
and is not fundamental to the problem at hand. 

The bound thus obtained is plotted as a function of 
outage probability for various Rayleigh parameters in 
Fig. |7J and as a function of Rayleigh parameter for 
outage probability of 0.1 in Fig. [8] Since the bound in 
Fig. |6] is loose for large 7, the same is expected for the 
bound in Fig. [8] for large a'^. 

V. A SCHEME FOR ACHIEVING IMPROVED RATES 
WITHOUT PHASE KNOWLEDGE 

The bound above raises a natural question. Is it at all 
useful to have the interference knowledge, without the 
phase? The curves in Fig. |3] suggest that our bound is 
loose at low SIR, therefore, it is likely that the greatest 
advantage lies there. 

At low SIR, a natural strategy is to break the in- 
terference uncertainty 'circle' into a few sectors. Fol- 
lowing [13], the transmitter time shares between coding 
strategies that dirty paper code according the 'central' 
interference vector in each sector. We call this strategy 
'sectoring'. The strategy is illustrated in |5] To analyze 
such a strategy, however, requires us to find the impact 
of small phase uncertainty on the achievable rates. The 
problem is non-trivial because the residual interference 
is not independent of the interference vector, and hence 
not independent of the codeword to be detected. 

A. Achievable rates with small uncertainty in the phase 

Assuming that the uncertainty in phase is a small value 
A(/), we investigate achievable rates for this reduced 
uncertainty. We wish to dirty paper code with respect 
to the 'central' interference vector in the uncertainty set, 
and treat the 'residual' uncertainty as noise. 

Assume that the the phase </) G [— Ac/;, A0] for some 
< A0 < ^. The transmitter chooses the codeword U" 
according to the usual rule U" = X" + aS" , where X" 
is chosen independent of S" as in [12]. The value of a 
depends on A0, as well as the SNR. The output Y" is 
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Fig. 6. The upper figure gives the upper bound on the rate for P = 
Q = lOdb and varying fade coefficient 7. Also plotted are the rates 
with perfect knowledge of the interference (the classical dirty-paper 
coding) and the rate ignoring the knowledge of the interference. The 
lower figure is the pdf of Rayleigh distribution for — 1. Assuming 
the transmission rate to be i? = 2 bits/symbol, the calculation of 
lower bound on outage probability is shown. The lower bound is the 
area of the shaded region in the lower figure. The slowest increase 
in the rate with increase in outage probability would be when the 
peak of Rayleigh distribution lies near the valley of the bound. A 
surprising aspect of the upper bound is that for large 7, it is an 
increasing function of 7. We think this is an artifact of our bounding 
technique, and is not the actual behavior of the capacity. 



given by 



^ X^ + SV'^ + Z" (10) 
= U" - aS" + S"e-'<^ + Z" (11) 



Since X" and S" are independent, U" is jointly Gaus- 
sian with S". Therefore, U" and Y" are also jointly 
GaussiaiH- 

We prove the following: 

Theorem 2: For a phase uncertainty of Ac/), an achiev- 

^Any linear combination of these two random variables is jointly 
Gaussian 
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Fig. 7. The upper bound on the achievable rate vs the outage 

- probability for varying parameter of Rayleigh distribution for P = 
Q = 10 dh. From Fig. [6] it is evident that the slowest increase 
in outage probability would be when the mean of the Rayleigh 

- distribution lies in the interval. This figure demonstrates the same 
explicitly. The capacity of channel with no uncertainty in 7 is ~ 3.46 

lobits/symbol. The outage probability is, therefore, 1 above this value. 
We think that the bound in Fig. |6] is loose at high SIR. Therefore, 
the bounds for large values of cr^ are also loose. Observe that all the 
curves converge to the same value for low outage probability. This 
value is the same as the minimum of the upper bound in Fig. [S] This 
is because the bound for outage probability of zero is a bound to the 
achievable rate for all values of 7. 
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Fig. 8. The upper bound on the achievable rate vs the Rayleigh 
parameter cr^ for P — Q = 10 db and an outage probability of 0.1. 
We think that the bound in Fig. |6] is loose at high SIR. Therefore, 
the bound is loose at large . We think that the outage capacity is 
a decreasing function of . 
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p 



(12) 



able rate is given by 

R = sup log , - I 
where 

£iA<p) = (1 - /3(A0))2p + (a2 + ^(A0)2 
-2a/3(A0) cos((/.))Q + /3(A<^)2 

and Pie) := 

Proof: See Appendix HIl 

S. Rates achieved by sectoring 

Suppose we divide the circle of uncertainty into k 
sectors, each forming an angle of Then we time-share 
dirty paper code with respect to the central interference 
vector in each sector. Intuitively, time sharing costs us 
a factors of k in rate. However, for the best sectors, 
uncertainty in phase is at worst j. 

The rate achieved by sectoring is therefore given by 



p 



(13) 



To get an intuition into the achievable rates, we do 
some approximate calculations. The residual uncertainty 
is (approximately) Q sin^(7r/A;). At low SIR, 

(14, 



R 



1 



k 



log 1 + 



kQir^ 



l + Qsm'^{ir/k)^ 

if Qsin^iiT/k) >> 1. The optimal k can be determined 
by maximizing the rate over k. Intuitively, if P < 1 and 
Q » 1, then we should keep increasing number of 
sectors till Qir'^/k'^ 1 (residual interference becomes 
comparable to 1). For this value of k, the rate obtained 
is 0(^-^V which is a gain of ^/Q. If Q » P > I, 
then we should keep increasing number of sectors till 
Qir'^/k'^ PS P (the log approximation fails at this point). 
The resulting rate is O y'^) ' ^ smaller advantage than 
for P < 1. 

We plot the (low SIR) region in which the sectoring 
method proposed here performs better than ignoring 
interference as noise (see Fig. |9ll. Fig. [TO] shows the 
optimal number of sectors with P for fixed interference 
power of 25 db. As expected, an advantage is obtained 
only in extremely low SIR region. 

VI. Performance with feedback: phase 

ESTIMATION 

In Appendix [III we derived a lower bound on rate for 
some uncertainty in the phase. This can happen if the 
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Fig. 9. Plotted on log scale, the difference between the two curves 
converges to a constant at low SIR. Thereby, the ratio of the two 
rates converges to a constant as SIR decreases. 
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Fig. 10. Number of sectors required to achieve optimal performance 
decreases as SIR increases. sectors corresponds to ignoring inter- 
ference as noise. Q is again maintained at 25db. 



transmitter gets some feedback from the receiver about 
phase estimate. This avoids the need to waste energy on 
transmitting for all possible phase sectors. 

Traditionally, the problem of phase estimation is over- 
come by the transmission of some pilot signal, which 
the receiver already knows. However, it is not reason- 
able to assume that the primary transmitter has a pilot 
signal known to the secondary receiver. Therefore, the 
receiver cannot estimate the phase without the secondary 
transmitter's help, since it does not know the signal 
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transmitted by the interferes 

Since the interference phase uncertainty is zero-rate 
— it does not grow with time — it is natural to ask 
if the receiver can estimate the phase and convey it to 
the transmitter at an asymptotically vanishing cost. In 
this section, we provide some ideas for phase estimation, 
and give a scheme to combat interference in the face of 
residual small uncertainty. 

A. Strategy for phase estimation 

Consider the following simple strategy to estimate 
phase. During time instants {1,2,..., k} (call this the 
"preamble" in analogy with training in traditional sys- 
tems), the transmitter transfers its 'prescient knowledge' 
of the S^^^~^ interference symbols by appropriately 
compressing them and encoding the compressed data 
to send over the channel. For this transmission, the 
receiver treats the interference as Gaussian noise of 
power Q. Now, the receiver estimates the phase during 
the first r time instants, reducing the uncertainty of 
phase to Ac/), which depends on channel noise as well 
as error introduced in quantization. The receiver then 
feeds back the phase estimate to the transmitterj^] Finally, 
the transmitter encodes the actual data payload for the 
remainder of the time. The transmission ends at the end 
of a packet or the end of the coherence time, whichever 
comes first. 

The above strategy is well suited for a packet-oriented 
communication scheme where the time between pack- 
ets is potentially many channel coherence times and 
unknown to the transmitter in advance. There is not 
much we can do in such scenarios since the architecture 
dictates that every packet must be self-sufficient. The 
price of this self-sufficiency is that the "preamble" faces 
huge noise levels, since the interference is unknown 
while decoding it. This imposes a significant rate penalty, 
particularly under high SIR conditions. 

The way around this problem is to assume a more 
continuously streaming communication model in which 
data will be transmitted block-after-block for many con- 
tiguous coherence times. In such cases, the block-fading 
model here is questionable and a Gauss-Markov model 
is probably more appropriate. However, inspired by [3], 

^The analogy may be more accurate with the RTS/CTS messages 
in many wireless protocols. The "training packet" plays the role of 
the RTS. The receiver responds with a small "phase packet" giving 
the interference phase to use — in analogy with a CTS message. 
Only then can the main data transmission proceed. The difference is 
that here, this is a PHY-level consideration since it impacts waveform 
design rather than a MAC-level issue. 



we hope that channel coherence issues are qualitatively 
the same in the two cases and stick with our block-fading 
model. 

Two stages of transmission are introduced. During 
the initialization stage, a "preamble" as above is used. 
But once the transmitter acquires the phase to a certain 
degree of accuracy, it dirty-paper-codes the data as well 
as prescient knowledge of the first r symbols of the next 
coherence time period and sends them to the receiver. 
In effect, the preamble's payload is combined with the 
main data payload. This is possible because unlike a pilot 
tone or a PRN-sequence used in a traditional preamble, 
we are not interested in the waveform properties of 
the preamble transmission. The relevant waveform for 
sounding out the interference channel is provided by the 
primary transmission — our only goal is to empower the 
receiver to utilize it. 

At the start of next coherence time, the decoder 
estimates the phase using the first r time instants and 
feeds this information back to the transmitter. Once this 
has happened, the process repeats with the next data 
block combined with the next preamble payload. The 
strategy is figuratively illustrated in Fig. [TT] Figure [12] 
(top curve) demonstrates the performance of this strategy 
with coherence time for Q = IQdb. The underlying 
tension is between obtaining a good estimate for the 
phase to improve data rate and wasting valuable time 
while calculating that estimate. A higher value of inter- 
ference allows for better estimation for the same dead- 
zone length. Therefore, the effective rate for increasing 
Q does not show a significant change. 

The strategy is detailed and analyzed in Appendix [Till 
The analysis is approximate because we assume the 
residual phase error to be small. For small values of 
the interference power Q, the receiver may as well 
ignore interference altogether, rather than spending time 
learning it. In that case, the phase uncertainty is vr, and 
the analysis is not valid. Therefore, the analysis holds 
only for large Q. The accuracy can be estimated from 
the phase value that attains the maximum rate. 

Sometimes the transmitter may not know when its next 
transmission would be. This situation can occur when the 
source generates data at random times, unknown to the 
transmitter in advance. In this case, the initialization step 
needs to be performed for each packet, and therefore, 
the prescient knowledge has to be decoded treating all 
interference as noise. After decoding prescient knowl- 
edge, the transmitter keeps silent for the 'dead zone', 
as the receiver uses the prescient knowledge to estimate 
the phase. The receiver then feeds the phase back, and 
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Fig. 11. Bootstrap strategy for phase estimation. The dead-zone 
allows for phase estimation. Transmitter sends some prescient knowl- 
edge of the interference to the receiver during the previous data block. 
The receiver uses this knowledge to estimate phase during the dead- 
zone. The process is repeated in next coherence time. 



transmitter dirty-paper codes in accordance with the 
phase knowledge, as before. 

This scheme is also analyzed at the end of Ap- 
pendix [nil The performance of this scheme is compared 
with that of the earlier strategy in Fig. [121 

VII. Conclusions and future work 

We showed that the dirty paper coding strategy re- 
quires the knowledge of the interference phase at the 
transmitter. At high SIR, there is a significant loss in 
performance due to not knowing the phase. At extremely 
low SIR, we provided a strategy that is able to utilize 
the knowledge of the interferer's signal to some extent 
without having to get an estimate of the phase. We 
showed that if substantial gains are desired, then it is 
worthwhile to engineer systems with feedback where the 
transmitter also spends some of its power in helping the 
receiver estimate the interference channel. 

There are many open problems here: 

• The achievable schemes need to be tightened for 
both the case of no phase knowledge as well as 
when feedback is available. 

• The sensitivity for multiple fading parameters be- 
yond just phase needs to be established. Does every 
additional uncertainty impose its own overhead or 
does the case of phase uncertainty already capture 
all the essential effects? 

• The upper bounds on the rate need to be tightened. 
The current bound is useless at low SINR and the 
current two-point uncertainty bound is also unable 
to adequately limit performance when the interferer 
is strong. 

• The results need to be extended to Gauss-Markov 
models for fading where the variation is more con- 
tinuous than the block-fading model here assumes. 
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Fig. 12. Variation of effective rate achieved for P = IQdb for bursty 
and non-bursty (contiguous time slots) transmissions. The bursty 
transmission results in significant loss of rate, with loss increasing 
for higher interference power(full phase knowledge Capacity-^ 3.46 
bits/symbol). For contiguous transmissions, increased coherence time 
allows for longer dead-zone with little loss in the rate. The variation 
of effective rate with Q, however, is not significant if we assume 
that the communication proceeds over contiguous coherent times 
since the data never really sees the interference. For the same 
relative fidelity of training quantization, a higher Q allows for better 
phase estimation in the dead-zone, thereby reducing the effective 
interference indirectly in proportion to its increase. This assumes that 
the higher resolution feedback message does not significantly tax the 
feedback link capacity. 



In the end, the results in this paper are "negative 
results" that suggest that attempts to exploit transmitter 
knowledge of the interference signal realization will 
encounter engineering difficulties. 

Appendix I 
Proof of Theorem 1 

Following [13], we assume that the transmitter has 
phase uncertainty in the two point set {0, vr}. That is, the 
transmitter knows that the interference vectors is either 
S" or -S". 

Remember that our model is 



Yn 
2 



X" + S" + 7/1 
X" - S" + 



(15) 



Observe that the performance at any receiver is deter- 
mined only by the marginal distribution of the noise 
vectors at that receiver. We are free to introduce any 
joint distribution on the noise vectors. For finding the 
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tightest upper bound, we assume Z" = Zj, that is, their 
correlation is 1. 



In [13], the interference vector lies in a two-point set 
of statistically independent vectors. In the bounding tech- 
nique there, one of the interference vectors is assumed 
to be zero. Using common randomness between the 
receiver and the transmitter, they show that this situation 
yields higher capacity than independent interferences of 
the same power. We note that for our case, the method 
in [13] to upper bound the capacity does not work. This 
is because the common randomness argument hinges on 
ability to generate random sequences at the transmitter 
and the receiver that simulate the interference ([13]). 
For a phase shifted interference vector, such sequence 
generation is not possible at the receiving end since it 
does not know the interference realization. A simple 
way of seeing this is where = 0. In this case, the 
usual dirty-paper coding scheme is optimal. However, 
assuming one of the possible interference vectors to be 
zero would decrease the rate beyond that! 

Since the transmitter is able to communicate reliably 
for either of these two phases, the cut-set bound and 
Fano's inequality [16] give us: 



nR < I{W;Y'l)+ne 



And therefore. 



nR < mm I {W ■,Yf ) +ne 



(16) 



(17) 



To avoid clutter, we drop the e's in the following steps. 
From (ITT] ). 

I{W-Y^)+I{W;Y^) 
nR< 

= ^{h{Y^) - h{Y^\W) + h{Y^) - h{Yl^\W)) 

< ^(/i(Y^) + /i(Y^)-/i(Y^',Y2"|VF)) (18) 

Suppose the correlation between Xi and Sk,i is pki for 
k = 1,2. We observe that for random variables of given 
second moment, the Gaussian random variable has the 
maximum entropy. Thus, 

n 

/i(Y^) < ^log(27re(P, + Q + 2/5fc,V^+^)) (19) 



i=l 



where Pi is the power of the i*^ transmitted symbol, X., 



Also, 



/i(Y^,Y^|iy) = h 



^1 ~ ^2 



V2 



W]+h 



Yf + YJ Yf-Y^ 
Yf + Y^ 



V2 



V2 



w 



■■^ 2 



V2 



(20) 



Since Yf — Y2 is independent of X" (and hence, of 



W), the first term in ([20] ) is easy to simplify 



h 



W 



V2 

where we use Z" = Z2. 



nlog(27re(5(l — cosvr)) 
nlog(47reQ) (21) 



The second term can be lower bounded as follows 



h 



Y'^ + Y^ 



> h 



Y'^ + Y^ 



V2 



V2 
W, 



' 2 



V2 



V2 



(22) 



Notice that Yf + Y^ = 2X" + 2Z?^. Therefore, 



Yf + YJ 



> K 



2Z" 

nlog(27re * 2iV) 



Using ([Sob , we can now lower bound the entropy 

h{Y'^,Y^\W) > nlog(47reQ) 
-l-n log(47reiV) 



(23) 



(24) 



Now in ([H), it is sufficient to bound /i(Yf ) + /i(Y^). 
Using 

hiY^) + hiYl^) 



< Ek=i,2 Ei log[2vre(P, + Q + 2pkiVm + N)] 
= Efc=i,2 log[2vre(P, + Q + 2puVm + N)] 

< 2 Y.i log[2^e(Pi + Q + {pu + p2i)V^ + N)] 



2Eilog[27re(P, + g + Ar)] 



< 2nlog[27re(P + Q + 7V)] 

Inequalities (a) and(c) follow from concavity of log(-) 
function. (6) follows from the fact that pu = —p2i- From 
(l24l ) and (|25] ). we get the following bound on the rate: 



R< - log 
- 2 ^ 



jp + Q + Ny 

4QN 



(25) 



□ 
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Appendix II 

Derivation of lower bound for partial phase 
knowledge 

A. Analysis for (j) = Ac/) 

In the following, we use the techniques developed in 
[1, Pg. 530-532] and [17]. 

We first detail the encoding and decoding scheme, and 
then analyse them for finding achievable rates. 

Generate a Gaussian [/— codebook, where each ele- 
ment of each codeword is drawn i.i.d. CJ\f{0, P + a^Q). 



Generate Ni 



u 



n(l-e) 



such codewords to form 



the codebook. Distribute them randomly into 2"^ bins. 
At the encoder, given message m G {1, . . . , 2^"^^ } 
and S", find U" that is jointly typical with S". Send 
X" = U" - aS". 

At the decoder, find the Linear Least Square Estimate 
(LLSE) of U" given Y" = X" + S" + Z", assuming 
is known, and its value is A0. Let the average LLSE 
error in each direction be denoted by £. Construct a 
sphere of radius n(l + 5)£ {5 > 0, small) around U". 
Decode to the [/—codeword in the sphere, if a unique 
such codeword exists. Else, declare a decoding error. 
We now proceed to analyse the above scheme. 
An encoding error would happen if for given S" and 
the message m, there is no jointly typical pair (U", S") 
in bin m. 

The probability that a pair (U", S") is jointly typical, 
where each of the elements is drawn independently, is 
greater than (1 — g)2~"^(^''^)~"'^ for sufficiently large 
n. Therefore, the expected number of jointly typical 
codewords in a bin is greater than (1 — e)2"'^ if 

iV„ X (1 - e)2^riI{U;S)-ne ^ ^^nR (26) 

has a positive exponent. If this condition is satisfied, then 
as in [17], the probability of error converges to zero. 
The expression I{U; S) simplifies to 



I{U;S) 



h{U) - h{U\S) 
h{U) - h{X) 



Therefore, for a positive exponent in (1261 ). 

P" 



R < log 



(27) 



There are two potential sources of decoding error. The 
transmitted codeword may not lie in Sjj, or there may 
be another codeword U" in Su- 



Let us first find the LLSE error £. The LLSE estimate 
gives us: 

Re[E[U*Y]]^ 



[/,; 



E\Y\^ 
P + a cos{A<f>)Q 



Y (28) 

Yi =: l3{A(^)Yi (29) 



P + Q + l 
where the LLSE error is given by 

£ = E[\U^-Ui\^] 

= E[\Xi + aS^ - ^{Xi + S.e^'^'^ + Z,)\^] 

= E[\{l-p)Xi + {a-Pe^^'f')Si-pZ,\^] 

= (1 - pfP + (a^ + - 2ap cos(A(/)))Q + p'^ 

The mean of the cross terms is zero, since by the dirty- 
paper-coding construction, Xi is approximately uncorre- 
lated with Si. In each of the real and the imaginary axes, 
the LSE is £/2. 

By the weak law of large numbers, with high proba- 
bility, the transmitted U— codeword falls in the sphere 
Su of radius n(l + e)V£ centered at U". To find the 
achievable rate, we now need to bound the probability 
of some other codeword U" falling in Si/. 

A Gaussian codebook of power P + a^Q can be 
generated in an alternative way. Choose any radial direc- 
tion randomly in the n— dimensional space. Then choose 
a point in this radial direction with the appropriate 
conditional distribution. This construction works because 
Gaussian distribution has uniform distribution over any 
radial direction. 

Another such distribution is uniform distribution over 
a sphere of radius n{P + a'^Q) in n— dimensional. To 
generate a codebook with distribution uniform over the 
whole sphere, choose a radial direction randomly, and 
then choose a point in the radial direction with the 
appropriate conditional distributior0- 

As n ^ oo, both the distributions concentrate towards 
the edge of the sphere. For gaussian codebook this is 
because of the weak law of large numbers, whereas 
for uniform codebook, it is because the volume in 
large dimensions concentrates around the surface of the 
sphere. For n large enough, conditioned on choosing the 
same radial direction, distance of a typical point on the 
radial direction chosen according to Gaussian codebook 
distribution from a typical point chosen according to 
uniform codebook distribution can be made smaller than 
ne for any e > for n large enough. Effectively, 
codewords of both the codebooks can be made arbitrarily 

^which is not the uniform distribution in tlie radial direction 
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close to each other by choosing n large enough. 

Therefore, the decoder can safely assume that the 
codewords were chosen uniformly. It decodes to the uni- 
form codebook codeword, which is close to the gaussian 
codeword for the same message. With that assumption, 
the probabiUty that some other codeword U" falls in the 
sphere Su is 



VoliSu) 



Vol{U - space) \\^P-\-a 



2n 



e 

2 



v2Q 



(30) 



Using the union bound, the probability that any other 
codeword falls in Su converges to zero as long as the 
number of [/—codewords is no greater than A'^^ = 

^^^''^'■'^or somee>0. 



Observe that the proof is fairly general. In particular, 
the proof also works for jointly gaussian X, S, Z with 
arbitrary correlations. The uniform codebook argument 
is brought in to use the volume arguments. The gaussian 
codebook is necessary for using typicality arguments. 



B. Analysis for (p < A(p 

In absence of knowledge of (j) at the receiver, we 
perform the same operations as for cp = Acp. The mean 
squared error now is 

£^ = E[\Ui - U^\''] 
= E[\Xi + aSi - p{Xi + Sie^'t' + Zi)\^] 
= E[il - f3{A<p)f\Xi\^ 
+{a^ + /3(A0)2 - 2a/3(A0) cos{^))\S,\^ 

+ cross terms] 



-2aP{Acj)) cos(0))Q + /3(A(/>)^ 



(31) 



The mean of the cross terms is zero, since by construc- 
tion, Xi is approximately uncorrelated with Si. 

Since |0| < A0, the mean square error is only 
smaller. Therefore, for < A(/) and for n large enough, 
the nearest codeword only comes nearer, for the same 
noise realization. Therefore, the nearest codeword still 
lies in the sphere. Since rest of the codewords are 
randomly chosen, uniformly in the sphere, using the 
same argument as in III-AI the probability of any another 
[/—codeword falling into the sphere is small. 

Therefore nearest neighbor decoding achieves the rate 
given by ([27]). □ 



Appendix III 
Analysis of proposed phase estimation scheme 

Denote by S" the receiver's estimate of the interfer- 
ence vector S". The transmitter source-codes S before 
sending it to the receiver. Denote the induced distortion 
by D. Since it is obtained from the source code for S", 
it looks like (see [18, Pg. 479]) 



{Q-D)Q 



D 



(32) 
and the 



where is Gaussian with variance 
scaling factor 7 = 

For k time instants, there is no transmission (dead- 
zone, r). During this time, the receiver estimates the 
phase and then sends it back to the transmitter. The 
feedback is modeled as instantaneous since the phase 
can be encoded using a very small number of bits. 

The receiver correlates the received vector with the 
signal it receives. Let the correlation random variable at 



Ah 



instant be denoted by Tj. Then, 



T, = {jSi + QnS^e^'^ + Ni[ 



(33) 



where Ni is the noise of unit variance. Tj can be 
simplified into 

= i^lSil^^'^) 
+{jS*Ni, + C*S,e^'t' + C:Ni) (34) 

It is clear from here that the sensitivity of the estimate is 
independent of what the phase (p actually is. Therefore, 
without loss of generality, we assume cp = 0. Denote 
the second term in (|34l ) by r]i. Suppose the receiver 
estimates for r time instants. Then the error phase error 
is approximately ■ 

The probability that this phase error is greater than 
A(p is 



Pr ( ELlHl > A<P 



Then, for large r, by the central limit theorem 



E[=i m 



A/ 



(35) 



(36) 



where E[r]f] = j'^Q + + Therefore, 

Pr(^ > V^A^) - Q ( ^^^] (37) 

where Q is the familiar Q— function. Set this probability 
of outage to some fixed [5 (say, 10^'^) that reflects the 
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acceptable probability of outage. The training time to 
reach A(/) error in phase with confidence of 1 — /3 is 
given by 

The number of data bits communicated Ndata is 



^data = ™^x[ (/coh-^)C(^ 



^coh-^/ 

where /^^j^ is the coherence time, and C{P) = log(^). 
The effective rate is therefore 



^eff 



data 



xoh 



(39) 



A. Performance with bursty transmissions 



In this model, the transmitter no longer communicates 
the prescient knowledge with data bits. Instead, the 
initialization stage needs to be performed for each data 
packet, resulting in a greater decrease in the effective 
rate. During the initialization, the transmitter sends its 
prescient knowledge to the receiver with average power 
Pi for time Tp^. The receiver ignores the interferences 
completely, treating it as noise, and decodes the prescient 
knowledge. After initialization, the transmitter is silent 
for the deadzone time, and the receiver estimates phase 
during this time, and feeds it back to the transmitter. 
Then the transmitter uses this phase estimate to dirty- 
paper code for the given interference, and transmits a 
signal of average power Pi . We do not allow for different 
transmit powers for transmitting the prescient knowledge 
and the actual data. Pi is given by 



coh 



^ J- 7 

xoh ~ '^pfe ~ '''dead 
The data bits are now communicated at the rate 

'El 



(40) 



C{Pi) = log 



(41) 



We now optimize over the distortion D, and the 
dead-zone rjg^^ to obtain the largest possible number 
of bits communicated. 

^data = ™ax (^(Z^oh - ^dead " ^p^) ^ C'(^i)) 
And the effective rate is given by 



-^eff 



N. 



data 
^coh 



(42) 
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